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We consider the quantum analogue of the pattern matching problem, which consists of classifying 
a given unknown system according to certain predefined pattern classes. We address the problem 
of quantum template matching in which each pattern class d is represented by a known quantum 
state Qi called a template state, and our task is to find a template which optimally matches a 
. given unknown quantum state /. We set up a precise formulation of this problem in terms of 

' the optimal strategy for an associated quantum Bayesian inference problem. We then investigate 

CNj ^ various examples of quantum template matching for qubit systems, considering the effect of allowing 

a finite number of copies of the input state /. We compare quantum optimal matching strategies 
^ ' and semiclassical strategies and demonstrate an entanglement assisted enhancement of performance 

, in the general quantum optimal strategy. 

^ ■ PACS numbers;03.67.-a, 03.65. Bz, 89.70.+C 
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, Let us consider the following pattern matching problem. We have at our disposal a database of recorded persons' 

■ pictures, organized into different classes according to certain defined features. Now we are given a further person's 
picture and we want to determine to which class the person belongs. We scan the database by comparing the defined 

■ features of the given sample with those of the classes. If the patterns of the sample have a good enough matching 
' with those of a certain class, then we can say that the person is recognized, that is, that the person belongs to that 

pattern class. We would like to consider a similar problem in the quantum mechanical context. 

There are various kinds of approaches to this problem Given a pattern we wish to classify it relative to a 

predefined set of pattern classes in such a way that we maximize some suitable measure of matching. This could be 
^2 ' formulated, e.g., in terms of vector representations, that is, a given image is discretized on a mesh and the contents 
in each pixel are approximated by the value of some predetermined intensity levels, say f(i,j) for the (i, j)-th pixel of 
a two dimensional mesh. Thus a pattern is represented by a vector / — (/(1, 1), /(I, 2), ...)^. When an input sample 
/o is given, an intrinsic feature is first extracted from it by removing noise and adjusting its size. The resulting data, 
^ , say /, called the feature vector, should be less noisy, less redundant and more invariant under commonly encountered 
' variations and distortions. We want then to classify this / into a pattern class from a set of classes {Ci}, where each 
class Ci contains some similar patterns. Classification is usually made by evaluating a discriminant function Di(f) 
associated with the pattern class Ci, and is such that if the input sample is actually in the class j, the value Dj{f) 
must be the largest. One way to make the problem more tractable is to represent each class Ci by a typical pattern 
called a template vector gi and to deal only with this vector as the representative of the class Ci. This is the template 
matching problem. 

Now consider a similar problem in the quantum mechanical context. We are given a feature state |/), which is 
usually unknown, and a set of classes {Ci} and their associated template states \gi), known a priori. The problem 
of quantum template matching is to classify the state |/) according to the set {|(7i)}, that is, to pick up the template 
which best matches with |/). This is similar in some respects to quantum state discrimination and quantum state 
estimation. In quantum state discrimination, a discrete set of states {pi} and their a priori probabilities {pi} are 
given. The task is to decide which state is received. In quantum state estimation, one is to reconstruct a given 
unknown state p by estimating certain parameters. In both scenarios, one usually minimizes a certain Bayes cost such 
as a decision or estimation error by using prior knowledge about the states. In quantum template matching, although 
we deal with an unknown input state generally specified by continuous parameters, the purpose is not to estimate 
the input state or to discriminate among the input states themselves, but to assign the best matched template state 
from amongst given candidates. In this sense quantum template matching involves aspects of both state estimation 
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and discrimination: the unknown input states are generally parameterized by continuous parameters which we wish 
to characterize only up to some approximation given by the "closest" template state. Indeed direct state estimation 
or discrimination would provide a strategy for template matching (by comparing the classical information of the 
estimated state with the classical information of the template identities) but this is generally not optimal - we should 
attempt to best match a template without necessarily obtaining any further more detailed information about the 
identity of the input state itself. 

In this paper we will set up a precise formulation of this problem (in section ||) in terms of a suitable intuitive 
matching criterion. The template matching problem will then appear as a problem of determining the optimal strategy 
for an associated quantum Ba yesia n inference problem |^,^ . We will then consider some examples of template matching 
for qubit systems (in sections III and IV) in particular, considering the effect of allowing a finite number of identical 
copies of the input state |/) (of course in a classical context this makes no difference). We will compare the optimal 
strategy (allowing full use of entanglement across the space of all copies) with two semiclassical strategies: 

(a) applying only separate measurements on each copy and processing the outcomes to decide the best matching; 

(b) applying the optimal state estimation strategy lT^-|l^] (using a collective measurement on the product state of all 
copies) and then classically comparing the identity of the reconstructed state with that of the template states. 

We will see that the optimal (fully entangled) strategy is more efficient than either of these. Finally in section ^ we 
will summarize our results and describe some interesting further possible generalizations of the concept of quantum 
template matching. 



II. BAYESIAN FORMULATION OF TEMPLATE MATCHING 

The Bayesian formulation is based on an a priori knowledge about the inputs: the input feature state / is unknown 
but it is assumed that we know the a priori probability distribution P{f) for possible inputs. Each template state cji 
representing the class Ci is assumed to be completely known. In order to classify / into a class Cj , we need to introduce 
a score S{Cj\f) which provides a matching criterion. One reasonable choice for the score in template matching is that 
derived from the similarity criterion 




5(C,|/)^ Tr h V/sW/ , (1) 



which is just the fidelity between the input state / and the template state gi. If the template states are pure states 
\gj) then this is just the standard overlap {gj \ f \gj). Under the similarity criterion, we are to choose the template for 
which the state overlap with / is largest. The matching strategy is represented by a probability operator measure 
(POM) {n,}: 

n, =nJ>o, ^n, =/. (2) 

This should be designed by using the a priori knowledge on the input and the template states, and the conditional 
scores S{Cj\f). The performance of a matching strategy is measured by the average score defined as 

S^J2j dfS{C,\f)P{C,\J)P{f), (3) 

where P{Cj\f) = Tr (IIj/) is the conditional probability that we have the j-th outcome given the state /. The best 
strategy is the one that maximizes this average score. If we introduce the score operators 



W,^ J dfS{C,\f)P{f)f, (4) 

then Eq. (|| ) can be rewritten as 

5-5]Tr(M/A)- (5) 
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Thus the problem is to find the optimal POM {tlj} that maximizes S given the set of score operators {VFj}- This 
is a standard quantum Bayesian optimization problem and necessary and sufficient conditions for optimality are well 
known : 

(i) f = W^jftj is hermitian, 

(6) 

(ii) t^Wj > yj. 

Note that since S{Cj\f) is always non-negative, we have Wj > as operators. Hence our optimal template matching 
problem (of optimizing eq. (||)) reduces to a standard quantum state discrimination problem - of distinguishing 
the mixed states Wj / Tr Wj (the normalized score operators) taken with prior probabilities Pj = Tr Wj / Tr Wj . 
In general the optimal strategy is unknown but we will consider examples exhibiting symmetry in which optimal 
strategies can be given. 



III. BINARY TEMPLATE MATCHING OF A TWO STATE SYSTEM 



We begin with the simplest case of quantum template matching in which there are only two classes, and each class 
is described by a template state which is a known pure qubit state. Furthermore the input states |/) will be restricted 
to depend on only a single real feature parameter. By taking an appropriate qubit basis {If) , |J,)}, the binary template 
states can be represented in terms of real components 

9 6 

l5o) =cos-|T)+sin-U), (7a) 
\g,) =sin^|t)+cos^U), (7b) 

with a single real parameter 9 specifying the nonorthogonality between the templates. As for the input state |/), we 
will assume that its input distribution is the uniform probability density over the great circle on the Bloch sphere 
defined by the two template states. Thus we can write 

|/(0))=cos^|T)+sin^U), (8) 

where the a priori density of (j) is uniform, P{f) = P{4') — (27r)^^. We are now to decide which template is closest to 
the given \f{4>)) in the sense of the highest state overlap. We suppose further that we are given TV identical copies of 
the input feature state \F[(j))) — |/(0))®^ and the average score can be written as 



j=0 "'o 



(9) 



where F{(f>) = \F{(f>)) (F(0)|. Note that our score S{Cj\f) is still just | {f{(t>)\gj) the overlap for a single copy (i.e. 
we are establishing a relation between the input pattern \ f{(t>)) and the templates \gj)) but our POM Ilj operates on 
the full space of N copies. The full input system is described on the A'^ + 1 dimensional totally symmetric bosonic 
subspace of H®^, Hb as 



fe=0 



where {|fc)} is the occupation number basis of the J,-component. For example, in the case of iV = 3, the basis state 
1 2) reads 



|2)^ ( ? ) (ITU) + Un) + IUT)). (11) 



Our score operators are then given by 



3 



W,^^J^^dc^F{<p)\{f{c^)\g,)\'; j = 0, 1, (12) 

where each Wj has support on the N + l-dimensional subspace Hb and Eq. can be rewritten as 

1 

5(7V)-^Tr(Ty,n,). (13) 

j=o 

Without loss of generahty the matching strategy {Ho, Hi} is constructed on Hb- In the occupation number basis 
representation (using eqs. (ITol) and (p^)), the score operators are exphcitly given as 



mw,m^,l[ ^)(?) '^"''';^g"^r''"" l(A'-^-Ocos», + (A. + l)], (14, 



when k + I is even, and 



/Mwi/\ f N\ f N\{k + l)U(2N-k-l)U 



when fc + Z is odd, where 9q = 9 and di — n — ' 



A. Optimal template matching 



Now we consider the optimal strategy that satisfies the conditions of Eq. (g). In the present case of binary 
classification, the analysis is rather straightforward, as we are to maximize the following quantity 



S{N) = Tr(T4^ono)+ Tr(T4^ini) 



Tr(M^i 



Tr 



{Wo-Wi)no 



(16) 
(17) 



where the resolution of the identity Ho + Hi ~ I was used in the second equality. Since Tr(W^i) = 1/2, Ho should 



be taken to maximize Tr 



(Wo- VKi)no 



that is, it should be the projection onto the subspace corresponding to the 



positive eigenvalues of the operator Wq — Wi. From Eqs. (|TJ) and (|T^) we have that 

1)!! (iV- 



{k\ {Wo - Wi) \l) = 




N\ {k + l~iy.\i2N ~ 
I ) (2iV)! 



k-l 



k-l) 



(iV + 1) 



COS0, 



(18) 



when k + I \s even, and (fc| {Wq — Wi) \l) = otherwise. Although we have not succeeded in deriving an explicit 
analytic expression for the eigenvalues Afe of Wo — Wi , we introduce the diagonalizing operator P such that 



^•(W'o - Wi)P^ = \k) (fc| J COS0, 

\fc=0 / 



(19) 



where Aq > Ai > .... > \n- Since Wo — Wi is antisymmetric in the antidiagonal (i.e. (Wq — Wi)ki = —(Wo — 
Wi)i\f^k,N-i) the eigenvalues match up in ± pairs: Ajv = — Aq, Ajv-i = — Ai, and so on (and when N is even, 



A 



N/2 



0). The optimal strategy can then be constructed from the pair of projection operators Ho,!!! onto the 



subspaces of nonnegative and negative eigenvalues respectively, which can be written as: 



fio = ptnMVp^ nMv^ 



Hi = ptnMVp^ nMv= 



^ 2 ^ 



L-J 

L 2 ^ 



+ ■ 



\N) {N\ , 



(20a) 
(20b) 
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where \N/2\ is the integer part of N/2. The maximum average score can be finally written as 



1 



LfJ 



k=0 



(21) 



The expressions (^0|) for the POJM also provide an intuitively appealing interpretation of the matching strategy, 
which consists of two steps. The first step is the unitary operation P which is applied to the A^-product input state 
\F{4>)). The second step is the measurement of the transformed state FF(0)pt by the POM {fl^^,flf^}. This 
corresponds to a separate measurement in the {|T) j li)} basis on each input copy space, followed by majority voting 
on the outcomes. In other words, the transformation P prepares the optimal entangled state for the final measurement, 
and (n^^) provides the projection onto the t-majority (the J,-majority) bosonic subspace. Note that for the 

case of a single copy (i.e. N = 1) Wq — Wi is diagonal in the {|t) , basis so that P = I and the measurement in 
this basis is the optimal strategy. 

In a semiclassical strategy where the separable measurement {II^^,Il5^^} (corresponding to the optimal mea- 
surement on each separate copy) is directly applied without using the transformation P, the attained average score 
becomes, instead 



LfJ 

Smv{N) = l + tl 

fe=0 



N\ {2k- 1)\\{2N -2k- 1)!! [N - 2k) 



(27V)!! 



(iV + 1) 



cos6'. 



(22) 



Soft{N) and Smw{N) are compared numerically in Fig. ^ for the case of orthogonal templates (6 — Q). The effect 
of P can be seen to reduce the required number of sample copies to attain a prescribed level of the average score 
(the curve denoted by "+" corresponds to the strategy consisting of quantum state estimation and classical matching, 
which will be explained in the next subsection). 
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FIG. 1. The average score in the binary template matching as a function of the available number of copies of the input. 
Three strategies are compared. The black circles represent Sopt{N) for the optimal strategy, while the white circles represent 
Smv{N) for the strategy of separable measurement + majority voting. The plus correspond to Sest{N,N + 1, -fj) for the 
strategy of the optimal state estimation + classical matching (section III B). 



Let us illustrate the optimal matching strategy in the case where we use three sample copies. The operator to be 
diagonalized is 



/ 15 VS \ 

cos6> 3 -V3 

26 x/3 -3 

V -\/3 -15 / 



(23) 



The diagonalizing matrix is found as 



5 



p 



V 



where cos 7 = [(2\/2T + 9)/4v/2T]i/2 and sin 7 = [(2^21 



P(Wo - Wi)P^ 



cos9 



COS7 





sin7 







C0S7 





— sm7 


— sin7 





C0S7 


■ 


A 
U 


sin7 


u 


COS^ / 




/2T- 


Q"! /4i 


^ ^ -L \ . V V L1J.CJ.1 IJ-Oj V C- 


/V2T- 


f 3 











V 


/2T- 


3 










-V2T + 3 


V 







-V2T- 



(24) 



(25) 



and 



<5'opt(3) — - + 
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cos6'. 



(26) 



One possible circuit structure for the optimal classifier is shown in Fig. ^ The input state |-F(0)) is first transformed 
by P, and is then processed interactively with two ancillary qubits via two controUed-NOT and two controUed- 
controUed-NOT gates. These steps implement {ft^^, tlf^} as a measurement on a single qubit. By measuring the 
second ancillary qubit in the basis {|t) we can decide with the maximum average score that the best 

matched template is \go) (respectively I51)) when the output is If) (respectively ||)). 
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FIG. 2. A circuit realization of the optimal classifier in the case of three input samples. The three input samples are 
first transformed by P, and then are processed interactively with two ancillary qubits via two controUed-NOT and two con- 
troUed-controUed-NOT gates. In our notation © is the operation which interchanges ||) and and the filled circles are control 
lines, i.e. the operation © is applied iff all control lines are in state ||). Finally, the second ancillary qubit |o")^ is measured in 
the basis {|T), |J,)}, and the two possible outcomes imply respectively that the best matched template is \go) or |gii). 



B. Template matching by state estimation 



Another possible kind of semiclassical strategy based on the optimal state estimation of a qubit is also considered in 
Fig. ^. Quantum state estimation deals with how to evaluate unknown parameters of a quantum state as precisely as 
possible. This idea can be naturally applied to template matching: one can first perform a quantum state estimation 
to estimate the input feature state, and then compare this reconstructed state with the templates classically. Optimal 
state estimation of a qubit using N identically prepared states has been recently studied in Refs. [T^-[l^. In particular, 
discrete and finite element optimal POMs were found (lIQ, and they maximize the following score 

S{N)^J2t- r d^Tr{uJi^r^)\{f{mM\'. (27) 

where |/(0m)) is a reconstructed state after the state estimation. The assignment of a guessed state m \f{4'm)) 
is now also to be optimized. This strategy was already described in Ref. ]l^ , but we rephrase it here in a slightly 
different and more practical way according to the results of Ref. . For convenience of calculation we introduce a 
new basis {\vo) , \vi)} for which our great circle of feature states \f {(/>)) is the equator. We fix the basis vectors by 
requiring the templates to have the symmetrical form: 
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|5o) = ^ (e-*(^-^) \vo) + e'(i-i) \vi) ) , (28a) 
|5i> = ^ (e*(i-i) \vo) + e-'(f \v,) ) . (28b) 



and the circle of feature states may be taken to be 

!/(</.)) ^i=(e-^K)+e*^M). (29) 

where again, the parameter (f) is uniformly distributed over [0, 27r). 

Let us also introduce M states equally spaced on the Bloch great circle (which will define our POM): 

|/„(^)) = [e-'(*+^)|t.o>+e*(*+^)|i;i)] ; (m = 0, 1, M - 1) (30) 

Here we have introduced a phase factor ip which determines the position of these symmetrical states relative to the 
fixed positions of the template states. The corresponding A^-fold tensor product states are: 

\FM) - lU^r'' - E fc)e-'(^-^'^)(-+^) , (31) 

where is the symmetric bosonic basis for {|t;o) , |t;i)}. It can then be shown that the square root measurement 

{\fim) (Mmll based on the states ((/?))}, that is, 

CA/-1 \ ~i 

E ' (32) 

m=0 / 



and the associated guess {\fmi^))}, provides an optimal state estimation strategy when we take M > N |16|. 

Thus, by applying the POM {|/im((^))(/i,„((p)|}, the input feature state is optimally reconstructed as one of the 
|/m((p)). Then one can classically compare this reconstructed state with the templates and pick up the template state 
which has the largest overlap with the reconstructed state. The above strategy for state estimation is optimal for 
any choice of ip. Indeed since our input state distribution is uniform, the score in eq. ( p7| ) will be independent of tp. 
But when we compare the reconstructed state with the templates, the resulting average score of template matching 
will depend on tp due to the fixed positions of the templates, i.e. different state estimation strategies which each 
give the same best possible score will generally give different scores for template matching via our classical method, 
and we should choose the best from our set of optimal estimation strategies. To complete our semi-classical template 
matching procedure we categorize the |/„i((p))'s into two classes according to the template states, that is, condense the 
set {\ii,n{ip)) (urnif)]} into a two element POM {IIq ^"'"((^), Il5^^"'"((/3)}, whose elements indicate that the best matched 
template is jf^o) or j^i), respectively. 

For simplicity let us assume that M is even. Then by symmetry, it is enough to consider ip in the range [0, ^). 
The categorization boundary is determined by the condition 

\{9o\fM)\ ^ \{gi\fM)\ (33) 

The values of m with \{go\fm{'p))\ > \{9i\fm{<p))\ are categorized into the class of \go), and the others into that of 
\gi). Noting that 

l(5o|/™(^))P - \{9i\fM)\' = 2sin(^ + ^)sin(| - 6), (34) 
the binary categorization should then be 

M-2 

2 

nf ^(^) = Y] |/im('^)>(Mm (</')!, (35a) 



m— 
A/-1 

E 



nf^M= E \i^^i^))M^)l (35b) 



7 



The average score for this strategy is 



Tr 



Tr 



where 



Wo = 



1 



2N+2 



4 — n V / i.— n \ / ' 



fc=0 ^ ^ fc=0 

By a straightforward calculation we obtain 



(36a) 
(36b) 

= Wl (37) 



fc=0 



fc 



TTT 



<Sest{N,N + 1,—) 

1 COS0 

= 2 + 2^(iV + l)sin(^) 



(38a) 
(38b) 
(38c) 



The quantity 5est(^, + 1, is compared with the optimal score Sopt{N) and the one Smv{N) obtained by the 
separable measurement plus majority voting scheme in Fig. |l| (for 9 = 0). As it can be seen, the strategy using the 
optimal state estimation followed by classical matching can be close to optimal for the region of small A^, while as A^ 
increases it starts to deviate from the optimal one and becomes closer to the strategy of separable measurement plus 
majority voting. 

The three strategies are schematically summarized in Figs. The quantum optimal strategy (Fig. ^ is realized 

by a collective measurement on the state 1/)*^^ with binary outputs. This is made by dividing the state space Tis 
spanned by 1/)**^ into 2 parts according to the templates, and by successfully using entanglement effects in Hb- On 
the other hand, the separable measurement plus majority voting scheme shown in Fig. || does not take any advantage 
of the entanglement which could be drawn from the state 1/)'^''^. In the optimal state estimation plus classical 
matching strategy shown in Fig. the collective measurement first performed for estimating |/) also utilizes an 
entanglement effect. However, this is not the best way for binary classification. In fact, the optimal state estimation 
requires dividing the space Hb into at least A^ + 1 parts. As A^ increases, one has to rely much more on the classical 
procedure to categorize the outputs into two classes. This is the reason why this strategy becomes ineffective for 
larger A^. Intuitively any intermediate measurement prior to the final decision tends to degrade the total performance 
leading to a waste of input copies for a given average score level, so the process for the best binary classification should 
stay entirely in the quantum domain until the very final measurement. 
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FIG. 3. A scheme of the optimal classifier for N input samples which is a generalization of Fig. |2| The binary classification 
of interest would eventually be turned into the measurement of a single qubit in the basis {| |), | |)}. 
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FIG. 4. Schemes of the strategy for the separable measurement plus majority voting, (a) is the direct translation of the POM 
which includes measurements. But this can be translated into a measurement on a single ancillary qubit plus an additional 
circuit (majority voting circuit) beforehand as shown in (b). The majority voting circuit includes a series of C-NOT gates just 
as in Fig. Q. 
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FIG. 5. A scheme of the strategy for the optimal state estimation plus classical matching. The optimal state estimation is 
a collective measurement on A'^ identical copies of the sample state. By applying this, the input feature state is reconstructed. 
The output would be one of the A'^ + 1 candidates |/m). Then one can compare this reconstructed state with the templates 
classically. This is actually a categorization of \ fm) into two classes according to the template states. 



IV. MULTIPLE TEMPLATE MATCHING OF A TWO STATE SYSTEM 



In the previous section we have assumed a single feature parameter <j) and a minimum number (two) of templates. 
In this section we extend our model to allow for multiple template matching. Although binary template matching 
can be reduced to the diagonalization of the operator Wq — Wi , there is no such straightforward method to find the 
optimal strategy in general cases. To keep the model tractable, we assume that the input feature state is a general 
qubit state depending now on two parameters, 

|/) = e-^^cos^|T) + e^^sin^U). (39) 

with a uniform a priori distribution over the whole Bloch sphere. Furthermore we suppose that only one of the 
parameters relates to the desired feature of |/), for example, the angle parameter (f) around the az axis, while the 
(Tz component itself is of no interest. The template states corresponding to this feature are assumed to be M states 
uniformly distributed around the great circle in the x — y plane of the Bloch sphere, that is, 

^^(e-^'^in+e^'^li)); (m = 0, 1, M - 1). (40) 

As before we have N copies of the input state as 



\F) ^ ur' - 




and generate the score operators based on this and the templates as 
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w„ 



2-K 



— I dct> j desme\F){F\ \{f\9,n)Y 



2(7V + 1) 



i + E (e-^^ 1^ + + \k) {k + l\) 



k=0 



N + 2 



(42a) 
(42b) 



We then seek the strategy to find the template which best matches with the given | /) in such a way to maximize the 
average score 



M-l 



s^J2 Tr(w^™n„0. 



(43) 



m=0 



As noted in section || this optimal template problem is equivalent to the problem of optimal discrimination of the set 
of mixed states ^Wm taken with equal a priori probabilities Pm = 1/M. 

The score operators evidently have the same symmetry as the templates, that is, 



where 



N 



. (W-2fc)7r 

e-'^J— \k) (fc|. 



(44) 



(45) 



fc=0 



is a unitary representation of the group of integers modulo M on the N + 1 dimensional bosonic subspace of N 
qubits. Indeed it is just the product representation V = w'*^ where v is the operation of rotation of the one-qubit 
Bloch sphere by 2ti/M about the z-axis. Now it is known |^ that for any group covariant set of states, the state 
discrimination problem always has an optimal strategy that is similarly group covariant, i.e. there will be an optimal 
POM of the form lim — y^IIo^^™ and the optimality conditions reduce to |q| 



M-l 



(i') f = ^ y^WoAot^l''" is hermitian. 



m=0 



(46) 



(ii') r - Wo > 0. 



We have succeeded in deriving an optimal strategy only in the case that M > N, i.e. when the number of 
copies is less than the number of templates. This is again the square root measurement built from the templates 

\Gm 



l5m)'*^, that is, the set {!!,„ = with 



\lJ-n 



G 2 \Gm) , G — y ' \Gm) {Gn 



(47) 



In fact, by using the orthogonality relation 

M-l 



E ®^p(' 



m=0 



2m7r 



n) = MSnfl for - M < n < M, 



(48) 



(so G like V is diagonal in the |fc) basis and G^/^ commutes with V), we find that 



1 ^ 

Im™) = i>™Imo), k) = -^Ei^)- 



(49) 



k=Q 



The optimality of this POM can then be proved by checking the conditions (i') and (ii') directly as follows. From Eq. 
(p8|), we obtain 



2(iV+ 1) 



JV-l 



k=0 



N + 2 



(50) 
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and, consequently. 



k=0 



^-^0^ 2{N + l){N + 2) ^ ViN-k)ik + l) |fc)(fc| + |fc + l)(fc+l|-|fc>(fc + l|-|fc + l)(fc| . (51) 



Since each 2x2 matrix inside the brackets [...] in Eq. (|5^) has the eigenvalues and 2 and is non- negative definite, 
so also is r — Wo andfii') of Eq. ( ^6| ) holds. Condition (i') of Eq. ( ^6| ) can be checked in a straightforward manner 
from Eqs. (^9|) and (|5l|). The maximum average score does not depend on M and reads 



5.,„(«).MlVaW.i+g^^M^ (52, 

We also note that |Go) is the maximum-eigenvalue eigenstate of Wq, i.e. the spectral decomposition is 

^ k + 1 

Wo^}2iu,\u;k)M, ^fc= (jv + l)(iV + 2) ' 

with \Go) — I^jJn)- This is especially interesting in view of the following theorem proved in 

Theorem: Let G be a group and let g — > be an irreducible representation of G on a c? dimensional Hilbert space 
Ti.. Let {Fg : g e G} be a collection of Hermitian operators on Ti, such that Fg — VgFeV^ (where e is the identity of 
G). For any POM X — {Xg : g G G} consider the function 



QiX) = TryPgX^ 



Let Z = (01 where |0) is the maximum eigenvalue eigenstate of F^ (and |G| is the size of G). 

Then Q is maximized by the POM {VgZV^ : e G}. 

Note that Vg |0) is a maximum eigenvalue eigenstate of Fg so the theorem claims that the G covariant POM based 
on these |G| eigen-directions is optimal. By irreducibility of the representation we have (via Schur's lemma) that 
Eg ^g^^g is ^ multiple of the identity for any operator A. Thus the square root measurement construction docs not 
alter these maximal eigen-directions when the representation is irreducible. In our template matching problem G is 
the group of integers modulo M and Fg correspond to the score operators Wm- Ti, is the + 1 dimensional bosonic 
subspace of N qubits and the group acts via m V"^ . This representation is not irreducible so the theorem does not 
apply. Yet we have shown that an optimal measurement is still obtainable from the maximum eigenvalue eigenstates 
of the score operators. In this case (of a reducible representation) the square root construction will give a non-trivial 
change in the directions of the maximal eigenstates, necessary to obtain a POM from them. This suggests a possible 
avenue of generalization for the above theorem of Q which we will explore elsewhere. 

In the other case M < N, that is, when we can use a larger number of copies of the input than the number of 
templates, the optimal POM is more complicated. This should include elements with rank 2 or higher because of the 
requirement that X]m=o ^jn = / in the A'' -|- 1 dimensional bosonic subspace. We have not yet found a systematic 
way to construct such higher rank POMs. Here we discuss some simple cases. 

The simplest case is M = 2, that is, binary classification. In this case, the two score operators commute and the 
strategy of separable measurement in the binary template basis on each copy plus majority voting turns out to be 



n 



/ i a c \ 
^ b 



c 



optimal (note that the binary template problem in section [II had a different distribution of input states and the the 
two templates there were not required to be orthogonal). 

The next simplest case is M = N = 3. The optimal POM is specified by 

\0 c a \j 

with a — {\/2l + a/5)/24, c = (VSS — \/3)/24, and b — 6ac, and the maximum average score is 

Srns,^(N) = TrP = — . (55) 
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This lio is derived by solving the equations for the condition 
satisfying the condition (pq)-(ii'). Ho is a rank 2 operator 



Ira)-(i') directly and then by picking up the solution 



with A . = 0.964 and A_ = 0.370 and 



no = A+|A+)(A+| + A_|A_)(A_| 



|A+> = 0.995 1^3) +0.100 
|A_> = 0.979 1^2) + 0.204 |wo) 



(56) 



(57a) 
(57b) 



where \ujo) , \uji) , \uj2) , 1^3) are the eigenstates of Wq corresponding to eigenvalues in increasing order (Eq. (p3|)). 
Thus, although the main component of Hq comes from the maximum-eigenvalue eigenstate |w3) of Wq, the other 
eigenstates are also involved with appropriate weights. 

Finally we mention the case of M = 3 and = 4. The optimal POM is specified by 

/ 



Ho 



1 

3 


a 


c 


-1 




a 


1 

3 


b 1 


\l 8 





c 


b 


1 

3 


b 


c 


1 




b 


1 
3 


a 


8^ 





c 


a 





(58) 



with b = V29 + VloT/24, c 



with A+ = 1 and A_ = 2/3, 



67 



V 

V3)/(24V2), and a = 66c. The structure of Hq is again of the form 
flo = A+|A+)(A+| + A_|A_)(A_|, 



|A+) = 0.992 |cj4) + 0.115 \uj2) + 0.044 |wo) 
|A_) = 0.984 |w3) +0.178 , 



and the maximum average score is 



5max(iV) 



5 + 4a + 2^/6b 
10 ' 



(59) 



(60a) 
(60b) 



(61) 



Generally speaking it is more difficult to find analytic solutions for the Bayes optimal strategy for mixed states, 
and one has to rely on numerical methods. The above examples indicate that the largest eigenvalue eigenstates of the 
score operator Wq should play an essential role in constructing the optimal POM (and maximizing the score), while 
smaller eigenvalue eigenstates can be regarded as perturbative correction terms. This might be helpful for considering 
efficient numerical algorithms for finding the optimal POM. 



V. CONCLUDING REMARKS 



We have considered the problem of quantum template matching, which is to find the template state that best 
matches a given input feature state. The quality of matching was taken to be the standard overlap of quantum states. 
This question was formulated in the context of quantum Bayesian inference and it was seen to be equivalent to the 
optimal discrimination of certain mixed states given in terms of score operators, each defined for a specific template 
state and including all the a priori information about the input. 

In this paper, the simplest case of binary classification of a two state system with a single feature parameter was 
extensively studied. We constructed the optimal strategy in the + 1 dimensional bosonic state space Hb spanned 
by the tensor products |/)^^ of N identical copies of the input state. The optimal state estimation on |/)®^ followed 
by a classical matching process does not provide the best strategy, and there is a different optimal use of entanglement 
for this particular binary classification problem. In the case of multiple template matching, the problem becomes 
more difficult and we derived the optimal strategies in a few illustrative cases. 
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As mentioned in the introduction, the procedure of conventional pattern matching consists of feature extraction, 
calculation of the discriminant function and classification. In the quantum context, however, it is not clear how 
to model such processes without an associated loss of useful information. For instance, to eliminate features of no 
concern one might simply project the input state onto the subspace spanned by the relevant states with the features 



of interest. But we saw in section [II that a quantum measurement, that is, a projection of states, carried out before 



the final template decision, is generally detrimental to optimal performance. In this spirit, we dealt with the problem 
in the original Hilbert space without projecting the input states onto the subspace for the features of interest (section 
IV), and the whole process of quantum template matching was represented by a single POM. It is of course an open 
question to formulate quantum protocols in more physically comprehensive ways, e.g. involving a separate non-trivial 
feature enhancement process prior to classification, and to systematically derive optimal strategies for them. 

But even without such additional infrastructure (e.g. feature enhancement) our problem of template matching 
has some interesting generalizations related to the role of classical versus quantum information in the formulation. 
In our formulation we have assumed that the input states (such as |/)) are given as quantum information (i.e. 
unknown quantum states) whereas the template states (|gi)'s with known identities) are given as classical information. 
Furthermore our goal was to obtain the best template as classical information (i.e. knowledge of the identity of the 
best \gi)) via a suitable POM. The ingredients of this formulation can be relaxed in a variety of potentially interesting 
ways and here we mention two such ways: 

(a) Instead of knowing the identities of the template states we may merely be given only some finite number (K) of 
copies of each template (so our original formulation is equivalent to K = oo). One matching strategy would then 
be to apply state estimation to the sets of K copies and proceed as in our original formulation with the resulting 
estimated state identities. But this is unlikely to be optimal and we should consider a more fully quantum procedure 
which, for any input |/), identifies the best template class (still here as classical information) without attempting to 
obtain any further information about the identities of the template states themselves. 

(b) A second more intrinsically quantum mechanical formulation of template matching involves obtaining the answer 
(i.e. the best matching template) only as quantum information. In this scenario we have a known prior distribution 
of inputs {\fi) ;pi} and a known set of possible templates {l^j}}. Then given one (or more) copies of \fi) we want to 
design a quantum process (i.e. a completely positive trace preserving map acting on the input) that outputs (one copy 
of) a quantum state ai of the form \ fi) ai — J2jPij Idj) (ffil such that some suitable average score PiPij^Ul''') 
is maximized. Note that the formulation in our paper (of getting the best template as classical information) would 
provide one possible strategy since we can then construct the corresponding template state as a quantum state, but 
again, this would not be expected to be optimal since we produce a great deal of unwanted extra information in 
addition to the desired quantum output state. 

There are yet further possible avenues for generalizing the formulation of the template matching problem. One is 
to study pattern classification with other kinds of matching criteria than fidelity, which would be chosen according 
to some specific application or purpose. For example, according to the quantum Sanov theorem (e.g. summarized in 
section IV of |17 ) the quantum relative entropy S{f\\g) between two quantum states provides an index for estimating 
the probability that the states will not be distinguished on the basis of an arbitrary measurement on A'^ copies of 
the state. Thus the relative entropy provides an alternative, operationally intuitive, notion of "distance" between 
quantum states and we may consider maximizing the average relative entropy as our similarity criterion in template 
matching for some purposes. 

The above remarks and generalizations show that the problem of template matching introduced in this paper is just 
the beginning of a fruitful area for further study. The formulation adopted in the paper is perhaps the simplest, in 
that it is closely related to an existing body of results on quantum Bayesian estimation. But a study of possible hybrid 
quantum-classical generalizations along the lines suggested above would provide a natural setting for characterizing 
new properties, and a deeper understanding, of quantum information itself, and especially the ways it fails to accord 
with familiar properties of classical information. 
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